Norwegian University of Science and Technology Technical Report IDI-TR-1/2003 Algorithms for Granularity Reduction in Temporal Document Databases

نویسنده

  • Kjetil Nørvåg
چکیده

With rapidly decreasing storage costs temporal document databases is now a viable solution in many contexts. However, storing an ever growing database can still be too costly, and as a consequence it is desirable to be able to physically delete old versions. Traditionally, this has been performed by an operation called vacuuming, where the oldest versions are physically deleted (or migrated from secondary storage to cheaper tertiary storage). However, in temporal document databases it is more appropriate to remove intermediate versions instead of removing the oldest versions. We call this operation granularity reduction. In this paper we describe six approaches to granularity reduction, and discuss advantages and disadvantages of these approaches. Three of the approaches have been implemented into the V2 temporal document database system, and in this context we discuss the cost of applying the approaches.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Norwegian University of Science and Technology Technical report IDI-TR-11/2002 Supporting Temporal Text-Containment Queries

In temporal document databases and temporal XML databases, temporal text-containment queries are a potential performance bottleneck. In this paper we describe how to manage documents and index structures in such databases in way that makes temporal text-containment querying feasible. We describe and discuss different index structures that can improve such queries. Three of the alternatives have...

متن کامل

Norwegian University of Science and Technology Technical report IDI-TR-X/2002, last revised: 2002-09-02 V2: A Database Approach to Temporal Document Management

The advent of large amounts of data on the web has closed the gap between the document storage and database communities. In this paper, this work is continued by the description of the foundations for temporal document databases. We describe the V2 temporal document database, which supports storage, retrieval, and querying of temporal documents. We describe functionality and operations/operator...

متن کامل

Norwegian University of Science and Technology Technical report IDI-TR-10/2002 Design, Implementation, and Performance of the V2 Temporal Document Database System

The advent of large amounts of data on the web has closed the gap between the document storage and the database communities. In this paper, this work is continued by the description of the foundations for temporal document databases. We describe functionality and operations/operators to be supported by such systems, and more specifically we describe the architecture for management of temporal d...

متن کامل

Norwegian University of Science and Technology Technical Report IDI-TR-09/2007 Semantic-Based Association Rule Mining of Temporal Document Collections

In many contexts today we have documents available in a number of versions. In addition to explicit knowledge that can be queried/searched in documents, these documents also contain implicit knowledge that can be found by text mining. In this paper we will study association rule mining of temporal document collections, and extend our previous work by 1) performing mining based on semantics as w...

متن کامل

Norwegian University of Science and Technology Technical Report IDI-TR-05/2008 PROQID: Partial restarts of queries in distributed databases

In a number of application areas, distributed database systems can be used to provide persistent storage of data while providing efficient access for both local and remote data. With an increasing number of sites (computers) involved in a query, the probability of failure at query time increases. Recovery has previously only focused on database updates while query failures have been handled by ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003